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Abstract 

The Integer Programming Problem (IP) for a polytope P C R n is to find an integer point in P or 
decide that P is integer free. We give a randomized algorithm for an approximate version of this problem, 
C*~) . which correctly decides whether P contains an integer point or whether a (1 + e) scaling of P around its 

barycenter is integer free in time 0(1/ e 2 )" with overwhelming probability. We reduce this approximate 
£^ . IP question to an approximate Closest Vector Problem (CVP) in a "near-symmetric" semi-norm, which 

^**^ ' we solve via a randomized sieving technique first developed by Ajtai, Kumar, and Sivakumar (STOC 

2001). Our main technical contribution is an extension of the AKS sieving technique which works for 
q ' any near-symmetric semi-norm. Our results also extend to general convex bodies and lattices. 

Keywords. Integer Programming, Shortest Vector Problem, Closest Vector Problem. 
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The Integer Programming (IP) Problem, i.e. the problem of deciding whether a polytope contains an in- 
teger point, is a classic problem in Operations Research and Computer Science. Algorithms for IP were 
first developed in the 1950s when Gomory IIGom581 gave a finite cutting plane algorithm to solve general 
(Mixed)-Integer Programs. However, the first algorithms with complexity guarantees (i.e. better than finite- 
ness) came much later. The first such algorithm was the breakthrough result of Lentra MLen831 . which gave 
the first fixed dimension polynomial time algorithm for IP Lenstra's approach revolved on finding "flat" 
integer directions of a polytope, and achieved a leading complexity term of 2°( n ) where n is the num- 
ber of variables. Lenstra's approach was generalized and substantially improved upon by Kannan MKan87l . 
who decreased the complexity to 0(n 2,5 ) n . Recently, Dadush et al MDPV111 improved this complexity to 

4 

0(na) n by using a solver for the Shortest Vector Problem (SVP) in general norms. Following the works 
of Lenstra and Kannan, fixed dimension polynomial algorithms were discovered for counting the number 
of integer points in a rational polyhedron MBar941 . parametric integer programming MKan901 IES08B . and 
integer optimization over quasi-convex polynomials IIHei051 IHKIOII . However, over the last twenty years 
the known algorithmic complexity of IP has only modestly decreased. A central open problem in the area 
therefore remains the following: 
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Question: Does there exist a 2°( n )-time algorithm for Integer Programming? 

In this paper, we show that the answer to this question is affirmative as long as we are willing to accept 
an approximate notion of containment. More precisely, we give a randomized algorithm which can correctly 
distinguish whether a polytope P contains an integer point or if a small "blowup" of P contains no integer 
points in 0(l/e 2 ) n time with overwhelming probability. Our results naturally extend to setting of general 
convex bodies and lattices, where the IP problem in this context is to decide for a convex body K and 
lattice L in R" whether K n L = 0. To obtain the approximate IP result, we reduce the problem to a 
(1 + e)-approximate Closest Vector Problem (CVP) under a "near-symmetric" semi-norm. 

Given a lattice L C R™ (integer combinations of a basis b\, . . . , b n G W 1 ) the SVPis to find min^g^x r i \\y 
and given x G R n the CVP is to find min^L ||y — x\\, where || • || is a given (semi-)norm. A semi-norm || • || 
satisfies all norm properties except symmetry, i.e. we allow 7^ || — x\\. 

Our methods in this setting are based on a randomized sieving technique first developed by Ajtai, Kumar 
and Sivakumar HAKS01I IAKS021 for solving the Shortest (SVP) and Closest Vector Problem (CVP). In 
HAKS01II . they give a 2°^ sieving algorithm for SVP in the £2 norm, extending this in HAKS021 to give 
a 2°(~)-time algorithm for (1 + e)-CVP in the £2 norm. In MBN0711 . a sieve based 2°( n )-time algorithm 
for SVP and (9(l/e 2 ) n -time algorithm for (1 + e)-CVP in any £ p norm is given. In [AJ08J, the previous 
results are extended to give a 2°( n )-time SVP algorithm in any norm (though not semi-norm). In MEHN11II . a 
technique to boost any 2-approximation algorithm for CVP is given which yields a 0(ln(^)) n algorithm 
for (1 + e)-CVP under £00. Our main technical contribution is an extension of the AKS sieving technique to 
give a 0(l/e 2 ) n algorithm for CVP under any near-symmetric semi-norm. 

1.1 Definitions 

In what follows, K C R n will denote a convex body (a full dimensional compact convex set) and L C R n 
will denote an n-dimensional lattice (all integer combinations of a basis of R n ). K will be presented by 
a membership oracle in the standard way (see section [2]), and L will be presented by a generating basis 
G W 1 . We define the barycenter (or centroid) of K as b{K) = vo ^ K -j f K xdx. 

For sets A, B C R n and scalars s, t € R define the Minkowski Sum sA+tB = {sa + tb : a G A,b G B}. 
'mt(A) denotes the interior of the set A. 

Let C C R n be a convex body where G int(C). Define the semi-norm induced by C (or gauge function 
of C) as ||x||c = inf{s > : x G sC} for x G R n . || • \\c satisfies all norm properties except symmetry, i.e. 
\\ x \\c / II — x \\c is allowed. || • ||c (or C) is 7-symmetric, for < 7 < 1, if vol(C n — C) > 7™ vol(C). 
Note C is 1-symmetric iff C = —C. 

For a lattice L and semi-norm || • \\c, define the first minimum of L under || • \\c as \\(C, L) = 
m fzgL\{o} \\ z \\c (length of shortest non-zero vector). For a target x, lattice L, and semi-norm || • ||c, define 
the distance from x to L under || • \\c as dc(L, x) = inf zg £ \\z — x\\c- 

1.2 Results 

We state our main result in terms of general convex bodies and lattices. We recover the standard integer 
programming setting by setting L = Z n , the standard integer lattice, and K = {x G R n : Ax < b}, a general 
polytope. For simplicity, we often omit standard polynomial factors from the runtimes of our algorithms (i.e. 
polylog terms associated with bounds on K or the bit length of the basis for L). 
Our main result is the following: 
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Theorem 1.1 (Approximate IP Feasibility). For < e < \, there exists a 0(l/e 2 ) n time algorithm which 
with probability at least 1 — 2~ n either outputs a point 

ye(l + e)K- eb(K) n L 

or decides that K n L = 0. Furthemore, if 



1+e 1+e 

the algorithm returns a point z £ K D L with probability at least 1 — 2~ n . 

The above theorem substantially improves the complexity of IP in the case where K contains a "deep" 
lattice point (i.e. within a slight scaling of K around its barycenter). Compared to exact algorithms, our 
methods are competitive or faster as long as 

1 n" 1 / 2 

;K + ———b(K)CM^%, 



l + n -i/2 l + n-Va 

where we achieve complexity 0(n) n (which is the conjectured complexity of the IP algorithm in MDPV1110 . 
Hence to improve the complexity of IP below 0(n s ) n , for any < 5 < 1, one may assume that all the 
integer points lie close to the boundary, i.e. that 

1 n~^ S 
—K + -UK) n L = 0. 



l + n~2 5 i +n ~2 S 

The above statement lends credence to the intuition that exact IP is hard because of lattice points lying very 
near the boundary. 

Starting with the above algorithm, we can use a binary search procedure to go from approximate feasi- 
bility to approximate optimization. This yields the following theorem: 

Theorem 1.2 (Approximate Integer Optimization). For v G W 1 , < e < \, 5 > 0, there exists a 
0(l/e 2 ) n polylog(|, ||f H2) algorithm which with probability at least 1 — 2~ n either outputs a point 

y £ K + e(K - K) n L 

such that 



sup (v, z) < (v, y) + 5 

zeKnL 



or correctly decides that K Pi L 



The above theorem states that if we wish to optimize over K n L, we can find a lattice point in a slight 
blowup of K whose objective value is essentially as good as any point in K n L. We remark that the 
blowup is worse than in Theorem ll.il since (1 + e)K — ex C K + e{K — K) for any x € K . This stems 
from the need to call the feasibility algorithm on multiple restrictions of K. To give a clearer understanding 
of this notion, the new constraints of the "blowup" body can be understood from the following formula: 



sup (v, x) = sup (v, x) + e I sup (v, x) — inf (v, x) 

x£K+e(K-K) x£K \x&K X ^ K 

Hence each valid constraint (v , x) < c for K, is relaxed by an e-fraction of its v's variation over K. 
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1.3 Main Tool 



We now describe the main tool used to derive both of the above algorithms. At the heart of Theorem ll.il is 
the following algorithm: 

Theorem 1.3. Let \\-\\c denote a ^-symmetric semi-norm. For x G W 1 , < e < i, there exists an 0(^^) n 
time algorithm which computes a point y G L satisfying 

\\y - x\\c < (1 + e)d c (L,x) 

with probability at least 1 — 2~ n . Furthermore, ifdc(L, x) < tX\(C, L), for t > 2, then an exact closest 
vector can be found in time 0(^) n with probability at least 1 — 2~ n . 

The above algorithm adapts the AKS sieve to work for general semi-norms. As mentioned previously 
MBN071 gave the above result for i v norms, and HAJ08B gave a 2°( n )-time exact SVP solver for all norms 
(also implied by the above since SVP < CVP, see MGMSS99I1 ). In MDPV11II . a Las Vegas algorithm (where 
only the runtime is probabilistic, not the correctness) is given for the exact versions of the above results 
(i.e. where an exact closest / shortest vector is found) with similar asymptotic complexity using completely 
different techniques, 

Hence compared with previous results, the novelty of the above algorithm is the extension of the AKS 
sieving technique for (1 + e)-CVP in general semi-norms. As seen from Theorems [Tj] and 11.21 the signif- 
icance of this extension is in its direct applications to IP. Furthermore, we believe our results illustrate the 
versatility of the AKS sieving paradigm. 

From a high level, our algorithm uses the same framework as HBN071IAJ08I . We first show that the AKS 
sieve can be used to solve the Subspace Avoiding Problem (SAP), which was first defined in MBN07I . and 
use a reduction from CVP to SAP to get the final result. The technical challenge we overcome, is finding the 
correct generalizations of the each of the steps performed in previous algorithms to the asymmetric setting. 
We discuss this further in section [3T2l 

1.4 Organization 

In section 12 we give some general background in convex geometry and lattices. In section |3~T1 we describe 
the reductions from Approximate Integer Programming to Approximate CVP as well as Approximate In- 
teger Optimization to Approximate Integer Programming. In section 13.21 we present the algorithm for the 
Subspace Avoiding Problem, and in section 1331 we give the reduction from CVP to SAP. In section 0J we 
present our conclusions and open problems. 

2 Preliminaries 

Computation Model: A convex body K C W 1 if (ao,r, i?)-centered if ao + r ^2 - K - a o + RB% , 
where is the unit euclidean ball. All the convex bodies in this paper will be (ao, r, R) -centered unless 
otherwise specified. To interact with K, algorithms are given access to a membership oracle for K, i.e. an 
oracle Ok such that Ok{x) = 1 if x G K and 0. In some situations, an exact membership oracle is difficult 
to implement (e.g. deciding whether a matrix A has operator norm < 1), in which situation we settle for a 
"weak"-membership oracle, which only guarantees its answer for points that are either e-deep inside K or 
e-far from K (the error tolerance e is provided as an input to the oracle). 



4 



For a (0, r, i?)-centered K the gauge function || • \\k is a semi-norm. To interact with a semi-norm, 
algorithms are given a distance oracle, i.e. a function which on input x returns It is not hard to 

check that given a membership oracle for K, one can compute \\x\\k to within any desired accuracy using 
binary search. Also we remember that \\x\\k < 1 x E K, hence a distance oracle can easily implement 
a membership oracle. All the algorithms in this paper can be made to work with weak-oracles, but for 
simplicity in presentation, we assume that our oracles are all exact and that the conversion between different 
types of oracles occurs automatically. We note that when K is a polytope, all the necessary oracles can be 
implemented exactly and without difficulty. 

In the oracle model of computation, complexity is measured by the number of oracles calls and arith- 
metic operations. 

Probability: For random variables X, Y G 0, we define the total variation distance between X and Y as 

drv(X,Y) = sup | Pr(X G A) - Pr(Y G A)| 

ACQ 

The following lemma is a standard fact in probability theory: 

Lemma 2.1. Let (X\, . . . ,X m ) G Sl m and (Y\, . . . ,Y m ) G Q m denote independent random variables 
variables satisfying dTv(Xi,Yi) < efori G [m]. Then 

d TV ((Xi,...,X m ),(Y 1 ,...,Y m )) < me 

Algorithms on Convex Bodies: For the purposes of our sieving algorithm, we will need an algorithm to 
sample uniform points from K. The following result of HDFK891 provides the result: 

Theorem 2.2 (Uniform Sampler). Given r] > 0, there exists an algorithm which outputs a random point 
X G K whose distribution has total variation distance at most rjfrom the uniform distribution on K, using 
at most poly(n, ln(i), ln(-P)) calls to the oracle and arithmetic operations. 

We call a random vector X G K rj-uniform if the total variation distance between X and a uniform 
vector on K is at most rj. 

Our main IP algorithm will provide a guarantee with respect to the barycenter of K. The following 
lemma allows us to approximate a point near b(K) with overwhelming probability: 

Lemma 2.3 (Approx. Barycenter). For e > 0, let b = J2iLi N = 72~> c > an absolute constant, 
and where Xi, . . . , Xn are iid 4 _n -uniform samples on K C W 1 . Then 

Vv[\\±{b-b{K))\\ K _ b{K) >e]<2- n 

Lattices: An n-dimensional lattice L C W 1 is formed by integral combinations of linearly independent 
vectors b±, . . . ,b n G M. n . Letting B = (pi, . . . , b n ), for a point x G K n we define the modulus operator as 

x mod B = B{B~ l x - [B~ 1 x\ ) 

where for y G M n , [y\ = {[yi\, • • • , [yn\)- We note that x mod B G B[0, l) n , i.e. the fundamental 
parallelipiped of B and that x — (x mod B) G L, hence x mod B is the unique representative of the coset 
x + LinB[0, l) n . 
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Convex Geometry: The following lemma provides us some simple estimates on the effects of recentering 
the semi-norm associated with a convex body. 

Lemma 2.4. Take x,y E K satisfying || ± (x — y)\\K-y < ex < 1. Then for z E M n we have that 
1. z E T-KT + (1 — r)y 44> ||z — y\\K-y < t 
2- || 2 - y\\i<- y < \\y - x\\ K ~x + u \l-\\z- x\\ K -x\ 

3. \\z - x\\ K -x < \\z ~ y\\K-y + 13^ |1 - \\Z - y\\K-y\ 

The following theorem of Milman and Pajor, tells us that K — b(K) is ^-symmetric. 

Theorem 2.5 ( HMP00I1 ). Assume b(K) = 0. Then vol(K n -K) > ± vol(K). 

Using the above theorem, we give a simple extension which shows that near-symmetry is a stable prop- 
erty 

Corollary 2.6. Assume b(K) = 0. Then for x G K we have that K — x is ^(1 — \\x\\K)symmetric. 

3 Algorithms 

3.1 Integer Programming 

We describe the basic reduction from Approximate Integer Programming to Approximate CVP, as well as 
the reduction from Approximate Integer Optimization to Approximate Integer Programming. 

Proof of Theorem \Ll\ ( Approximate Integer Programming). We are given < e < ^, and we wish to find 
a lattice point in (1 + e)K — eb{K) n L or decide that K D L = 0. The algorithm, which we denote by 
ApproxIP(.ff, L, e), will be the following: 

Algorithm: 

1. Compute b G K, satisfying || ± (b — b{K))\\ K _ h ^ < |, using Lemma |23l (see details below). 

2. Compute y G L such that y is 1 + ^ approximate closest lattice vector to b under the semi-norm 
|| • \\x-b using Approx-CVP (Theorem 13.71) . 

3. Return y if y G \\y - b\\ K _ b < 1 + ^, and otherwise return "EMPTY" (i.e. K n L = 0). 

Correctness: Assuming that steps (1) and (2) return correct outputs (which occurs with overwhelming 
probability), we show that the final output is correct. 

First note that if \\y — b\\x~b < 1 + X' tnen ^ Lemma [2~4l we have that 

\\y - b{K)\\ K _ b{K) < \\y - b\\ K _ b + i |1 - \\y - b\\ K . h \ <l + ^ + ^ = l + e 

as required. Now assume that K n L / 0. Then we can take z G L such that \\z — b\\x^b < 1- Since y is 
a 1 + j closest vector, we must have that \\y — 6||^_;, < 1 + ^. Hence by the reasoning in the previous 
paragraph, we have that \\y — b{K)\\ K _ b ^ < 1 + e as needed. 
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For the furthermore, we assume that j^K + jr^b(K) n L 7^ 0. So we may pick z G L such that 
|z — 6(i4T) b(i^) < jt^- By Lemma [2~4l we have that 

Ik - Hx-b < W* - KK)\\K-b(K) + j^T I 1 - Ik - < j^- e + ^ 1^7 = T^f 



1*4* ~ 

Next by the assumptions on y, we have that ||y — &||a'-& < T+f + if) — ^ smce < 6 ^ \- Hence 
y £ K Pi L as needed. 

Runtime: For step (1), by Lemma 12/31 we can compute b G K, satisfying || ± (b — b{K))\\ K _ b ^ < |, 
with probability at least 1 — 2~ n , by letting b be the average of 0(n 2 ) 4 _n -uniform samples over K. By 
Theorem 12. 2[ each of these samples can be computed in poly(n, ln(^)) time. 

For step (2), we first note that by Corollary I2.6L K — b is (1 — = \- symmetric. Therefore, 
the call to the Approximate CVP algorithm, with error parameter ^ returns a valid approximation vector 
with probability at least 1 — 2~ n in time 0(3(^) 2 ) n = 0(l/e 2 ) n . Hence the entire algorithm takes time 
0(l/e 2 ) n and outputs a correct answer with probability at least 1 — 2 _n+1 as needed. □ 

Proof of Theorem\L2\( Approximate Integer Optimization). We are given v G M n , < e < i, and 5 > 
where we wish to find a lattice point in if + e{K — K) n L whose objective value is within an additive 5 
of the best point in K n L. We remember that K is (ao, r, it!) -centered. Since we lose nothing by making 
5 smaller, we shall assume that 5 < \\v\\2r. We will show that Algorithm [Qcorrectly solves the optimization 
problem. 

Algorithm 1 Algorithm ApproxOPT(Jf, L,v,e,5) 

Input: (ao, r, i?)-centered convex body K C W 1 presented by membership oracle, lattice L C IR n given by 

a basis, objective v G W l , tolerance parameters < e < ^ and 5 > 
Output: "EMPTY" if K n L = or z G K + e{K — K) Ci L satisfying snp yeKnL {v, y) < {v, z) + 5 

1: z <— ApproxIP(ET, L, e) 

2: if z = "EMPTY" then 

3: return "EMPTY" 

4: Compute xi,x u G If using the ellipsoid algorithm satisfying mf x£ K {v, x) > {v,xi) — and 

su Px£K ( v i x ) < (v,Xu) + n 

Set / <s— (f , z) and u <— (u, x u ) + ^ 
while u - I > 5 do 

771 <— i(u + /) 



y ^ApproxIP(K n {x G R n : m < (v, x) <u}, L, e) 
if y = "EMPTY" then 

u <— m 

y ^ApproxIP(if n {x G W 1 : I < (v, x) < m}, L, e) 
if y = "EMPTY" then 
Set u <— I and y <— z 
if (v, z) < (v, y) then 
Set z <— y and I <— (v, z) 
return z 
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Correctness: Assuming that all the calls to the ApproxIP solver output a correct result (which occurs with 
overwhelming probability), we show that Algorithm Q] is correct. As can be seen, the algorithm performs a 
standard binary search over the objective value. During iteration of the while loop, the value u represents the 
current best upper bound on sup yeKnL {v, y), where this bound is achieved first by bounding sup^g^- (v, x) 
(line 5), or by showing the lattice infeasibility of appriopriate restrictions of K (line 10 and 13). Similarly, 
the value I represents the objective value of the best lattice point found thus far, which is denoted by z. Now 
as long as the value of z is not null, we claim that z G K + e(K — K). To see this note that z is the output 
of some call to Approx IP, on K a>b = K n {x 6 M. n : a < (v,x) < b} for some a < b, the lattice L, with 
tolerance parameter e. Hence if z is none null, we are guaranteed that 

z € (1 + e)K a>b - eb(K a , b ) = K a , b + e{K a , b - b{K a , b )) 

C K a , b + e(K a , b - K a , b ) Q K + e(K — K) 

since b(K ab ) C K ab C K. Therefore z G K + e(K — K) as required. Now, the algorithm returns if 
"EMPTY" if K n L = (line 3), or z if u — I < 5 (line 17). Hence the algorithms output is valid as required. 



Runtime: Assuming that each call to ApproxIP returns a correct result, we first bound the number of of 
iterations of the while loop. After this, using a union bound over the failure probability of ApproxIP, we get 
a bound on the probability that the algorithm does not perform as described by the analysis. 

First, we show that gap u — I decreases by a factor of at least | after each iteration of the loop. Note that 
by construction of m, if K m)U is declared "EMPTY" in line 8, then clearly u — I decreases by | in the next 
step (since u becomes m). Next, if a lattice point y is returned in line 8, we know by Equation (13.11 ) that 
y G K m ,u ~ t{K m ,u ~ K m ^ u ). Therefore 

(v,y) > inf (v,y) — e sup (v,y) — inf (v , y) \ > m — e(u — m) (3.2) 

x&K m ,u \x€K mu ®£ K r> 



^ m.v 



Since m = |(Z + u), and e < |, we see that 

113 

u — (v, y) < (u — m) + e(u — m) < —(u — I) + —(u — I) = —(u — I) 

as needed. From here, we claim that we perform at most [ m ( tojff^ / ^ n (|)l iterations of the for loop. Now 
since K C a + RB%, note that the variation of v over K (max minus min) is at most 2i2||u||2. Therefore, 
using Equation (13.21 ). the initial value of it — I (line 6) is at most 

2R\\v\\ 2 + e{2R\\v\\ 2 ) + A < 2R\\v\\ 2 + ^{2R\\v\\ 2 ) + < 4fl||t;|| 2 

Since u — I decreases by a factor at least | at each iteration, it takes at most [ln( 4fi |"^ 2 )/ In |] iterations 
before u — I < 5. Since we call ApproxIP at most twice at each iteration, the probability that any one 
of these calls fails (wherupon the above analysis does not hold) is at most 2 [ln( 4-R |"^ 2 )/ In |] F, where F 
is the failure probability of a call to ApproxIP. For the purposes of this algorithm, we claim that the error 
probability of a call to ApproxIP can be made arbitrarily small by repetition. To see this, note any lattice 
vector returned by ApproxIP(A' a b , L, e) is always a success for our purposes, since by the algorithm's 
design any returned vector is always in K a)b + e(K ajb — K a>b ) n L (which is sufficient for us). Hence 
the only failure possibility is that ApproxIP returns that K a b n L = when this is not the case. By the 
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guarantees on ApproxIP, the probability that this occurs over k independent repetitions is at most 2 . 
Hence by repeating each call to ApproxIP 0(1 + ^ In In ^W.) times, the total error probability over all 
calls can be reduced to 2~ n as required. Hence with probability at least 1 — 2 _n , the algorithm correctly 
terminates in at most [ln( 4-R |"^ 2 ) / In |] iterations of the while loop. 

Lastly, we must check that each call to ApproxIP is done over a well centered K a ^, i.e. we must be able 
to provide to ApproxIP a center a' Q G K n and radii r', R' such that a' Q + r'B^ C K a fi C a' + R'B%, where 
each of o' , r', R' have size polynomial in the input parameters. Here we can show that appropriate convex 
combinations of the points X[ , x u (line 4) and ao G K allow us to get well-centered points inside each K mjU 
(line 8) and K^ m (line 11). For simplicity in the presentation, we delay this discussion until the full version 
of the paper. 

Given the above, since we call ApproxIP at most twice in each iteration (over a well-centered convex 
body), with probability at least 1 — 2~ n the total running time is 0(\) n polylog(i?, r, S, \\vW2) as required. 

□ 

3.2 Subspace Avoiding Problem 

In the following two sections, CCK" will denote be a (0, r, i?)-centered 7-symmetric convex body, and 
L C W 1 will denote an n-dimensional lattice. 

In this section, we introduce the Subspace Avoiding Problem of HBN071 . and outline how the AKS sieve 
can be adapted to solve it under general semi-norms. We defer most of the analysis to the full version on the 
paper. 

Let M C M n be a linear subspace where dim(M) = k < n — 1. Let X(C,L,M) = inf xeL \ M \\x\\c- 
Note that under this definition, we have the identity Ai(C, L) = A(C, L, {0}). 

Definition 3.1. The (1 + e) -Approximate Subspace Avoiding Problem with respect C, L and M is to find a 
lattice vector y £ L\M such that \\y\\ c = (1 + e)A(C, L, M). 

For x G R n , let \\x\\q = min{||x||c, IMI-c}- For a point x G W 1 , define s(x) = 1 if \\x\\c < IMI-c 
and s(x) = —1 if \\x\\c > \\x\\~c- From the notation, we have that \\x\\q = ||^|| s (a;)C = ll s ( a; ) a:; llc- 

We begin with an extension of the AKS sieving lemma to the asymmetric setting. The following lemma 
will provide the central tool for the SAP algorithm. 

Lemma 3.2 (Basic Sieve). Let (x±, y\), (X2, 2/2)) ■ ■ ■ , (%N,yN) G M n x R n denote a list of pairs satisfying 
yi — Xi G L, WxiWc < (3 and \\yi\\c < D Vi G [N]. Then a clustering, c : {1, . . . , N} — > J, J C [N], 
satisfying: 

/5\ n 1 
1. I J| < 2 I -J 2. \\yi - y c{i) + x c ^)\\* c < -D + f3 3. yi - y c(i) + x c{i) - x { G L 

for all i G [N] \ J, can be computed in deterministic 0(N )-time. 
Proof. 

Algorithm: We build the set J and clustering c iteratively, starting from J = 0, in the following manner. 
For each i G [N], check if there exists j G J such that \\yi — yj\\ s ( Xj )c < If suc h a 3 exists, set c(i) = j. 
Otherwise, append i to the set J and set c(i) = i. Repeat. 
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Analysis: We first note, that for any i, j G [N], we have that m — yj + Xj — X{ = (m — Xj) — (yj — Xj) G L 
since by assumption both yi — Xi,yj — Xj G L. Hence, property (3) is trivially satisfied by the clustering c. 

We now check that the clustering satisfies property (2). For i G [N] \ J, note that by construction we 
have that \\yi — y c (i)\\sC < y where s = s(x c ^). Therefore by the triangle inequality, we have that 



\vi - y c (i) + x c (i)\\*c < \\ui - y c (i) + x c (i)\\sc < hi - y c (i)\\sc + \\x c (i)\\sc 

D 
~2 



\vi - y c (i)\\sc + lk c (i)llc < + P 



as required. 

We now show that J satisfies property (1). By construction of J, we know that for i, j G J, i < j that 
11% - yi\\s(xi)C > §• Therefore we have that 

\\Vj -yi\\s(xi)C > y => 11%' -ViWcn-C = \\yi - yj\\cn-c > y (by symmetry of Cn -C) 
From here, we claim that 

y i + £(Cn-C)n yj + j(Cn-C) = Q). (3.3) 

Assume not, then we may pick z in the intersection above. Then by definition, we have that 

\\Vj ~ ViWcn-C = \\(yj -z) + (z- yi)\\cn-c < \\yj - z\\cn-c + \\z - yi\\cn-c 

D D D 

= \\ z ~ yj\\cn-c + \\z - yi\\cn-C < -j + -j = 

a clear contradiction. 

For each i G [N], we have by assumption that \\yi\\* c < D 44> yi G D(C U — C). Therefore, we see that 

Vi + j(C n -c) c d(c u-c) + ^(cn -C) 

= D{(c + 1 -(cn -co) u (-c + i(c n -C))) (3.4) 
c D{(C + \c) u (-c + \(-c))) = \d(c u -c) 

From (1331) . (1341 ). and since J C [iV], we have that 

= vol({^:iG J} + f(Cn-C)) < vol(|D(CU-C)) 

vo i(f(cn-c)) " vo i(f(cn-c)) 

(f) n (vol(DC)+vol(- J PC)) /5 V 
^)%ol(DC) V7, 



as needed. 

To bound the running time of the clustering algorithm is straightforward. For each element of [N] , we 
iterate once through the partially constructed set J. Since \J\ < 2 ( - ) throughout the entire algorithm, we 



7 

5' n 



have that the entire runtime is bounded by 0(N ( - J ) as required. □ 
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Definition 3.3 (Sieving Procedure). For a list of pairs (x\,yi), . . . , (xn, Un) as in Lemma 13^21 we call an 
application of the Sieving Procedure the process of computing the clustering c : [N] — > J, and outputting 
the list of pairs (xi, yi — y c ^ + x c ^) for all i £ [N] \ J. 

Note that the Sieving Procedure deletes the set of pairs associated with the cluster centers J, and com- 
bines the remaining pairs with their associated centers. 

We remark some differences with the standard AKS sieve. Here the Sieving Procedure does not guaran- 
tee that \\y-i\\c decreases after each iteration. Instead it shows that at least one of 1 1 y-i \ \ c or 1 1 — yi \ \ c decreases 
appropriately at each step. Hence the region we must control is in fact D(C U — C), which we note is gener- 
ally non-convex. Additionally, our analysis shows that how well we can use || • \\c to sieve only depends on 
vol(C n — C) I vol(C), which is a very flexible global quantity. For example, if C = [—1, l] n_1 x [—1, 2 n ] 
(i.e. a cube with one highly skewed coordinate) then C is still ^-symmetric, and hence the sieve barely 
notices the asymmetry. 

The algorithm for approximate SAP we describe presently will construct a list of large pairs as above, 
and use repeated applications of the Sieving Procedure to create shorter and shorter vectors. 
The next lemma allows us to get a crude estimate on the value of A(C, L, M). 

Lemma 3.4. Let C C W 1 a (0, r, R)-centered convex body, L C W 1 be an n-dimensional lattice, and 
M C W 1 , dim(Af ) < n — 1, be a linear subspace. Then a number v > satisfying 

v < X(C, L, M) <T-v 
r 

can be computed in polynomial time. 

The above lemma follows directly from Lemma 4.1 of MBN0911 . They prove it for i v balls, but it is easily 
adapted to the above setting using the relationship £ ||x||2 < ||^||c < \ 1Mb (since C is (0, r, i?)-centered). 
The following technical lemma will be needed in the analysis of the SAP algorithm. 

Lemma 3.5. Takev G R™ where (3 < \\v\\ c < §/3. Define C+ = (3C(~)(v- f3C) and C~ = (fiC-v)n-pC. 
Then 

voi(c+) _ voi(c-) ny 

vol(/3C) vol(/3C) - U/ 
Furthermore, int(C+) H int(C~) = 0. 

The following is the core subroutine for the SAP solver. 

We relate some important details about the the SAP algorithm. Our algorithm for SAP follows a standard 
procedure. We first guess a value (3 satisfying /3 < A(C, L, M) < and then run Short Vectors on inputs 
C, L, M, (3 and e. We show that for this value of /3, ShortVectors outputs a (1 + e) approximate solution 
with overwhelming probability. 

As we can be seen above, the main task of the ShortVectors algorithm, is to generate a large quantity 
of random vectors, and sieve them until they are all of relatively small size (i.e. 3/3 < 3A(C, L,M)). 
ShortVectors then examines all the differences between the sieved vectors in the hopes of finding one of size 
(1 + e)A(C, L, M) in L \ M. ShortVectors, in fact, needs to balance certain tradeoffs. On the one hand, it 
must sieve enough times to guarantee that the vector differences have small size. On the other, it must use 
"large" perturbations sampled from (3{C U — C), to guarantee that these differences do not all lie in M. 

We note that the main algorithmic differences with respect to BBN071 IAJ081 is the use of a modified 
sieving procedure as well as a different sampling distribution for the perturbation vectors (i.e. over f3(C U 
— C) instead of just j3C). These differences also make the algorithm's analysis more technically challenging. 
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Algorithm 2 ShortVectors(C,L,M,/3,e) 



Input: (0, r, R) -centered 7-symmetric convex body C C W l , basis B G Q nxn for L, linear subspace 
M C R n , scaling parameter j3 > 0, tolerance parameter < e < | 

1: D <— nmaxi<j< n \\Bi\\c 

2: N <- 4[61n (f )l (p)" + 8 ( r, <- ^ 

3: Create pairs (x§, y?),(x^, y°), • • • >( x n ' ^N ) as f°ll° ws: for eacn * £ [ N o], 

compute X an ^-uniform sample over /3C (using Theorem 12.21 ) and a uniform s in {— 1, 1}, 
and set x? <— and y? <— x? mod 5. 

4: t <r- 

5: while D > 3/3 do 

6: Apply Sieving Procedure to (x* , y* ) , . . . , (x* Nt , y* Nt ) yielding (x^ 1 , y'^ 1 ) , . . . , (x^ , y*+ x +i ) 

7: L> <- § + /? and t <- t + 1 

8: return {y^ - x* - (y* - x*) : i,j G [N t ]} \ M 



Theorem 3.6 (Approximate-SAP). For < e < | Zaffice vector y G L \ M swc/z f/iaf ||y||c < (1 + 
e)A(C, L, M) can Z?e computed in time 0(;p^) n w/J/z probability at least 1 — 2 _n . Furthermore, if 
A(C, L, M) < t\i(C, L), t > 2, a vector y G L\M satisfying \\y\\c = A(C, L, M), can &e wz'z7i computed 
in time O with probability at least 1 — 2~ n . 

Proof. 

Algorithm: The algorithm for (1 + e)-SAP is as follows: 

1. Using Lemma l3T4l compute a value u satisfying v < A(C, L, M) < 2 n ^u. 

2. For each i G 0, 1,... , [ln(2 n .R/r)/ln(3/2)"|, let P = (3/2)V and run Short Vectors (C, L, (3, e). 

3. Return the shortest vector found with respect to || • ||c in the above runs of Short Vectors. 

Preliminary Analysis: In words, the algorithm first guesses a good approximation of A(C, L, M) (among 
polynomially many choices) and runs the ShortVectors algorithm on these guesses. By design, there will be 
one iteration of the algorithm where j3 satisfies j3 < X(C,L, M) < |/3. We prove that for this setting of j3 
the algorithm returns a (1 + e) -approximate solution to the SAP problem with probability at least 1 — 2~ n . 

Take v G L\M denote an optimal solution to the SAP problem, i.e. v satisfies \\v\\c = A(C, L, M). 
We will show that with probability at least 1 — 2~ n , a small pertubation of v (Claim 4) will be in the set 
returned by ShortVectors when run on inputs C,L,M,j3, and e. 

Within the ShortVectors algorithm, we will assume that the samples generated over (5C (line 3) are 
exactly uniform. By doing this, we claim the probability that ShortVectors returns a (1 + e)-approximate 
solution to the SAP problem by changes by at most 2~^ n+1 \ To see this, note that we generate exactly No 
such samples, all of which are ^-uniform. Therefore by Lemma I2TTI we have that the total variation distance 
between the vector of approximately uniform samples and truly uniform samples is at most NqI] = 2~( n+1 \ 
Lastly, the event that ShortVectors returns (l+e)-approximate solution is a random function of these samples, 
and hence when switching uniform samples for n-uniform ones, the probability of this event changes by at 
most 2~( n+1 ). Therefore to prove the theorem, it suffices to show that the failure probability under truly 
uniform samples is at most 2~( n+1 \ 
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In the proof, we adopt all the names of parameters and variables denned in the execution of Short Vector. 
We denote the pairs at stage t as (x*,yf), . . . , (x^y , y^ ). We also let C+, C~ be as in Lemma [3721 For 
any stage t > 0, we define the pair (x*,yf), i G [Nt], as good if x- G int(C+) U int(C~). 



Claim 1: Let Q denote the event that there are at least ^ (V\ Nq good pairs at stage 0. Then Q occurs 

with probability least 1 — e - ^ 7 ™^ . 

Let Gi = I[x° G int(C+) U int(C~)] for i G [No] denote the indicator random variables denoting 
whether (x9,y?) is good or not. Let Sj, i G [N ], denote the {—1, 1} random variable indicating whether 
x° is sampled uniformly from (3C or — f3C. Since (3 < \\v\\c < §/?, by lemma [3751 we have that 

Pr[G, = 1] > Pr(x° G int(C+)| Si = 1) Pr(s; = 1) + Pr(x° G in^C")^ = -1) Pr( S , = -1) 

i voiosc n (v - pc)) , i voi(G8C - v) n (-pc)) (r\ n 



2 voI(PC) 2 vol(-pC) ~ W 

From the above we see that #[EiIi G i] > (i)" ^o- Since the G 4 's are iid Bernoullis, by the Chernoff 
bound we get that Pr[0] = PrE 4 =i G» < \ (i)" ^o] < e-A 7 ™^ , as needed. 



Claim 2: Let T denote the last stage of the sieve (i.e. value of t at end of the while loop). Then conditioned 

on Q, the number of good pairs at stage T is at least Nq = 4 (j^j ■ 

Examine (x°, y?) for i G [No]. We first claim that ||y°||^ < D. To see this note that y? = Bz where 
z = £ _1 x° - L-B _1 x°J G [0, l) n . Hence 

n n 

||y° He 1 < lly^llc = || y^-Bj^llc < yj^HSillc < ?i max \\Bi\\ c = D 



Ki<n 

i=l i=l 



as needed. Let Z) t = max{||y*||J. : i G [Nt]}, where we note that above shows that Dq < D. By Lemma 
we know that N t > N t _i - 2 ( ^) and that A < \ D t -i + /3 for t > 1. For A > 3/3, we see that 



ln( — ) 

+ /3 < |l?t- Given the previous bounds, an easy computation reveals that T < |~ 'f ] < [61n(S)]. 

From the above, we see that during the entire sieving phase we remove at most T(2) < 2[61n(^)] 

pairs. Since we never modify the x*'s during the sieving operation, any pair that starts off as good stays good 
as long as it survives through the last stage. Since we start with at least | (j) n Nq good pairs in stage 0, we 
are left with at least 



L (1 

2 U 



good pairs at stage T as required. 



Modifying the output: Here we will analyze a way of modifying the output the ShortVectors, which will 
maintain the output distribution but make the output analysis far simpler. 
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Let w(x) = I[x G 0C] + I[x G — fiC]. Letting X be uniform in f3C and s be uniform in {—1, 1} (i.e. 
the distribution in line 3), for x G /3(C U — C) we have that 



d Pt[sX = x] = d Pr[X = x] Pr[s = 1] + d Pr[X = -x] Pr[s = -!} = - ( ^ ^ + ^ G ^ 



2 Vvol(/3C) vol(/3C) 
ii?(x) 



2vol(/3(C)) 

(3.5) 



Examine the function /„ : /3(C U -C) ->■ /3(C U -C) denned by 



x — t> : x G int(C+ 
x + u : x G int(C~ 
x : otherwise 



Since int(C+) n int(C~) = 0, it is easy to see that f v is a well-defined bijection on /3(C U — C) satisfying 
fv{fv(x)) = x. Furthemore by construction, we see that / 1) (int(C+)) = int(C~) and f v (mt(C~)) = 
int(C+). Lastly, note that for any x G f3(C U — C), that f v (x) = x mod B since f v (x) is just a lattice 
vector shift of x. 

Let F v denote the random function where 



F v (x) 



: with probability 



f v {x) : with probability w(/ " (a:)) 



W(x)+W(f v {x)) 
W(fv(x)) 

w(x)+w(f v (x)) 



Here, we intend that different applications of the function F v all occur with independent randomness. Next 
we define the function c v as 



c v {x,y) ■■ 

For any stage t > 0, define x* = ^(x*, y*) 



fv(x) : \\y- fv(x)\\* c < \\y-x\\* c 
x : otherwise 



Claim 3: For any stage t > 0, the pairs (x\ , y{ ) , . . . , (x^ t , y^ t ) , and (F v , y\ ),..., (F v (x^ ) , ) 
are identically distributed. Furthermore, this remains true after conditioning on the event Q. 

To prove the claim, it suffices to show that the pairs in (1) (x* , y^ ),..., (x^- , y^- ), and 
(2) (F v (5t\),y t 1 ), (x^yl ),•••, (x^y^) are identically distributed (both before and after conditioning 
on Q). Our analysis will be independent of the index analyzed, hence the claim will follow by applying the 
proof inductively on each remaining pair in the second list. 

The pairs in (1) correspond exactly to the induced distribution of the algorithm on the stage t variables. 
We think of the pairs in (2) as the induced distribution of a modified algorithm on these variables, where the 
modified algorithm just runs the normal algorithm and replaces (x\,y\) by (F v (c v (x\, y\)), y\) in stage t. 
To show the distributional equivalence, we show a probability preserving correspondance between runs of 
the normal and modified algorithm having the same stage t variables. 

For < k < t, let the pairs (x^, y^), i G [N^], denote a valid run of the normal algorithm through stage 
t. We label this as run A. Let us denote the sequence of ancestors of (x\ , y\) in the normal algorithm by 
{Xa k ,ya k ) for < fc < i — 1. By definition of this sequence, we have that x° Q = x\ t = ■ ■ ■ = x\. Since 
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the ShortVectors algorithm is deterministic given the initial samples, the probability density of this run is 
simply 

dPr [n ie[No] {^ = x ,}] = dPr [x° = x\) J] dPr [x° = z°] (3.6) 

ie[JVo],i^ao 

by the independence of the samples and since x° aQ = x\. Notice if we condition on the event Q, assuming 
the run A belongs to Q (i.e. that there are enough good pairs at stage 0), the above probability density is 
simply divided by Pr[C/]. 

If x\ £ int(C+) U int(C~), note that F v (c v (x\, y\)) = F v (x\) = x\, i.e. the action of F v and c v are 
trivial. In this case, we associate run A with the identical run for the modified algorithm, which is clearly 
valid and has the same probability. Now assume that x\ G C+ U C~ . In this case, we associate run A to two 
runs of the modified algorithm: A. identical to run A, C. run A with (x^ k ,y^ k ) replaced by {fv(x^ k ),y^ k ) 
for < k < t — 1. Note that both of the associated runs have the same stage t variables as run A by 
construction. 

We must check that both runs are indeed valid for the modified algorithm. To see this, note that up till 
stage t, the modified algorithm just runs the normal algorithm. Run A inherents validity for these stages 
from the fact that run A is valid for the normal algorithm. To see that run C is valid, we first note that 
f v {x° ao ) = x Q ao = y® Q mod B (B is the lattice basis), which gives validity for stage 0. By design of 
the normal sieving algorithm, note that during run A, the algorithm never inspects the contents of x k ak for 
< k < t. Therefore, if {x^ k ,y k k ) denotes a valid ancestor sequence in run A, then so does {fv(x ak ),ya k ) 
in run B for < k < t. For stage t, note that the normal algorithm, given the stage inputs of run A would 
output (x\,y\), . . . , (xN t) yN t ) f° r tne stage t variables, and that given the stage inputs of run C would 
output (f v (x\), y\), (x2, 2/2)) • • • j ( xt Nt , y% t )- Hence in run A, the modified algorithm retains the normal 
algorithms output, and in run C, it swiches it from f v (x\ ) back to x\ . Therefore, both runs are indeed valid 
for the modified algorithm. Furthermore, note that if run A is in Q, then both the stage variables of A 
and C index a good run for the normal algorithm since the pair (x„ , y ao ) is good iff {f v {Xa ), Ua ) is good. 
Hence we see that correspondance described is valid both before and after conditioning on Q. Lastly, it is 
clear that for any run of the modified algorithm, the above correspondance yields a unique run of the normal 
algorithm. 

It remains to show that the correspondance is probability preserving. We must therefore compute the 
probability density associated with the union of run A and C for the modified algorithm. Using the analysis 
from the previous paragraph and the computation in (13.6I ). we see that this probability density is 



dPr[x° = x\] Pv[F v (c v (x\,yi)) =x\] + dPr[x° Q = f v (x\)\ Pr^M/^), y\)) = x\) 

II dPr[x° = x°] (3 " 7) 

ie[N ],i^a 

On the first line above, the first term corresponds to run A which samples x\ in stage and then chooses to 
keep (x\, y\ ) in stage t, and the second term corresponds to C which samples f v (x\) in stage and chooses 
to flip (f v (x\), y\) to (x\, y\) in stage t. Now by definition of F v and c v , we have that 

Pr[F v (c v (fv(x[),y{)) = x\] = Vv[F v {c v {x\,y\)) = x[] - 



w(x[) + w(f v (x\)) 
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Therefore, using the above and Equation (13.51 ). we have that the first line of (13.71 ) is equal to 

fdPrfx -x*l+dPrfx° -fix 1 )}) W{xtl) -( | w ifv( x i)) \ MA) 

iarr L x ao x x \ tarr^ m^jj ^t) + ^y^t)) ^ 2 vol(/3C) + 2vol(/3C) ) w(x{) + w(f v (x\)) 



w ( x l) _ HPr r„0 __ti 
2vol(/3C7) - dFr i x ao-^iJ 



Hence the probabilities in Equations (13.61 ) and (13.71 ) are equal as needed. Lastly, note that when conditioning 
on Q, both of the corresponding probabilities are divided by Pr[<7], and hence equality is maintained. 



Output Analysis: 



Claim 4: Let T denote the last stage of the sieve. Then conditioned on the event Q, with probability 

at least 1 — (|) 2N ° there exists a lattice vector w G {yf — xf — (yj — xj) : i,j G [N t ]} satisfying 
(f) w G L \ M, w — v G Mni and ||u> — v\\c < e/3. Furthermore, any lattice vector satisfying (f) 
is a (1 + e) -approximate solution to SAP. 

Let (xf , yf ), . . . , (xfj t , yf T ) denote any valid instantiation of the stage T variables corresponding to a 
good run of the algorithm (i.e. one belonging to Q). Let (xf ,yf) = (c v (xf , yf), yf) for i G [Nt]- By 
claim 3, it suffices to prove the claim for the pairs (F v (xf ), yf), . . . , {F v {xfj T ),yf !T ). This follows since 
the probability of "success" (i.e. the existence of the desired vector) conditioned on Q, is simply an average 
over all instantiations above of the conditional probability of "success". 

Since our instantiation corresponds to a good run, by Claim 2 we have at least Nq good pairs in stage 
T. Since c v preserves good pairs, the same holds true for (xf ,yf), i G [Nt]. For notational convenience, 
let us assume that the pairs (xf,yf), i G [Nq] are all good. We note then that f v (%f) = xf ± v and 
f v (f v (xf)) = xfforie[N G ]. 

First, since T is last the stage, we know that ||y^||^ < 3/3 for i G [Nq]- Next, for i G [No], by definition 
of c v we have that 

\\yf - xf\\* c = mm{\\yf - xf\\* c , \\yf - f v (xf)\\* c } 

Let s = s(yf), i.e. \\yf \\^ = \\yf\\ s c- Since (xf,yf) is good at least one of —xf,—f v (xf) G j5sC. 
Without loss of generality, we assume — xf G f3sC. Therefore, we get that 

\\yf - xf\\c < hi ~ xf\\*C < hi - xf\\ s c < hfWsC + II - xf\\ sC < 3/3 + P = 4/3 (3.8) 

Let S denote the set {yf — xf : i G [Ag]}. Since xf = yf mod B, we note that S C L. Also by 
Equation (13.81 ) we have that S C 4/3(C U -C) fl L. Let ACS denote a maximal subset such that 

x + int(f/8(C n -C)) ny + int(f /3(C n -C)) = for distinct x, y G A. Since S C 4/3 (C U -C), we see 
that for x £ S 

x + |/3(C n -C) C 4/3(C U -C) + |/3(C n -C) C (4 + |) /3(C U -C) 
Therefore we see that 

vol((4 + f)/3(CU-C)) = /8 + e\ n vol(CU-C) 
1 voi(§ p(cn-C)) V e / vol(Cn-C) 

< fttlV i^2L < 2 flV < In c 

- \ e J 7 «vol(C) " V7e/ " 2 
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Since A is maximal, we note that for any x G S, there exists y G A such that 

int(x + |/3(C n -C)) n mt(y + |/3(C D -C)) ^Q&\\x- y\\cn-C < (3-9) 

Let ci, . . . , cm £ [-^g]> denote indices such that A = {y^ — xf. : 1 < i < | A|}, and let C = {cj : 1 < j < |A|}. 
For j G {1, . . . , |A|}, recursively define the sets 

/, = {i G [JV G ] : - xf) - (yj - xJ)||cn-C < \ (c U (U^/,)) (3.10) 

Given Equation 13.91 we have by construction that the sets C,I±,. . . , Im partition [iV(j]. For each j G 
{1, . . . , |A|}, we examine the differences 

5, = ±{(yf - F v (xf)) - {y T c . - F v {x T c .)) : i G /,} 

We will show that Sj fails to contain a vector satisfying (t) with probability at most (!) . First we note 
that Sj C L since yj = xj = F v (xJ) mod 1? for z G [Ag]. 

We first condition on the value of F v (xf.) which is either xf.,xf. — v or rr^ + v. We examine the 
case where F v (xf.) = xf., the analysis for the other two cases is similar. Now, for i G Ij, we analyze the 
difference 

yf - F v (xJ) - [y% - x T c .) (3.11) 

Let 8i = (yf — xj ) — (yj^ — xf.). Depending on the output of F v (xJ), note that the vector (13.1 II) is either 
(a) 8i or of the form (b) ±v + Si (since f v (%J) = %J ± v). We claim that a vector of form (b) satisfies (f). 
To see this, note that after possibly negating the vector, it can be brought to the form v ± 5, L G L, where we 
have that 

\\±8i\\c < \\±Si\\ C n-C = \Mcn-c <ep< e\(C,L,M) < X(C,L,M), (3.12) 

since i G Ij and e < \. Since 5{ G L and \\Si\\c < A(C, L, M), we must have that ±<5j G M n L. Next, 
since v £ L \ M and ±<5j G M, we have that v ± (Sj G L \ M. Lastly, note that 

||«±«5i||c < Me + \\ ± $i\\c < \(C,L,M) + eP < (1 + e)A(C, L, M) 

as required. 

Now the probability the vector in (13.111 ) is of form (b) is 



w(xf)+w(f v (xf)) - 3 



Pr[F v (xf) = UxJ)\ = ...,-rV. A %v ^ 5 



since for any x G /5(C U —C) we have that 1 < w(x) < 2. Since each i G Jj indexes a vector in Sj not 
satisfying (f ) with probability at most 1 — | = |, the probability that Sj contains no vector satisfying (t) is 

at most (|)' /r ' (by independence) as needed. 

Let Fj, j G {1, . . . , |A|}, denote the event that Sj does not contain a vector satisfying (f). Note that Fj 
only depends on the pairs (F v (xJ.),yf.) and (F v (xf),yf) for i G Ij. Since the sets I\,..., I\^\, C partition 
[Nq], these dependencies are all disjoint, and hence the events are independent. Therefore the probability 
that none of Si, ... , S|^| contain a vector satisfying (f) is at most 



-ton© -(§) *(|) 



as needed. 
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Runtime and Failure Probability: We first analyze the runtime. First, we make 0(n log— ) guesses 
for the value of A(C, L, M). We run the Short Vectors algorithm once for each such guess j3. During one 
iteration of the sieving algorithm, we first generate No 77-uniform samples from f3C (line 3). By Theorem 
12.21 this takes poly(n, In i In j3, In R, In r) time per sample. We also mod each sample by the basis B for L, 
which takes poly(|B|) time (\B\ is the bit size of the basis). Next, by the analysis of Claim 2, we apply the 
sieving procedure at most [6 In ■§] times (runs of while loop at line 5), where each iteration of the sieving 

procedure (line 6) takes at most 0(No ) time by Lemma P3721 Lastly, we return the set of differences 

(line 8), which takes at most O(Nq) time. Now by standard arguments, one has that the values D and j3 (for 
each guess) each have size (bit description length) polynomial in the input, i.e. polynomial in \B\ (bit size 
of the basis of L), n, InR, lnr. Since Nq = 0(ln(-^)(^) n ), we have that the total running time is 



1 xn 



poly(n,lni?, lnr, |JB|, Ve) O —7-5 as needed. 

We now analyze the success probability. Here we only examine the guess f3, where j3 < X(C, L, M) < 
Assuming perfectly uniform samples over (3C, by the analysis of Claim 4, we have that conditioned on 

Q, we fail to output a (1 + e) approximate solution to SAP with probability at most (|) . Hence, under 
the uniform sampling assumption, the total probability of failure is at most 

2 ^ 2 — -/2\ 2 _i 7 n N (n+1) 



+ Pr[S c ]<(jJ +e-« rxNo <2- 

by Claim 2. When switching to ^-uniform samples, as argued in the preliminary analysis, this failure 
probability increases by at most the total variation distance, i.e. by at most r/No = 2~^ n+1 \ Therefore, the 
algorithm succeeds with probability at least 1 — 2~( ra+1 ) — 2~( n+1 ) = 1 — 2~ n as needed. 



Exact SAP: Here we are given the guarantee that X(C,L,M) < t\i(C,L), and we wish to use our 
SAP solver to get an exact minimizer to the SAP. To solve this, we run the approximate SAP solver on 
C, L, M with parameter e = j, which takes 0(t 2 /7 4 ) n time. Let v € L \ M be a lattice vector satisfying 
\\v\\c = A(C, L, M). By Claim 4, with probability at least 1 — 2~ n we are guaranteed to output a lattice 
vector w £ L \ M, such that 

H^-^lc < e\(C,L,M) < (jj tX^CL) = Xi(C,L) 

However, since w — v € L and \\w — v\\c < X±(C, L), we must have that w — v = 0. Therefore w = v and 
our SAP solver returns an exact minimizer as needed. □ 



3.3 Closest Vector Problem 

In this section, we present a reduction from Approximate-C VP to Approximate-SAP for general semi-norms. 
In MBN0711 . it is shown that £ p CVP reduces to £ p SAP in one higher dimension. By relaxing the condition 
that the lifted SAP problem remain in l v , we give a very simple reduction which reduces CVP in any semi- 
norm to SAP in one higher dimension under a different semi-norm that is essentially as symmetric. Given 
the generality of our SAP solver, such a reduction is suffices. 
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Theorem 3.7 (Approximate-CVP). Take x G W 1 . Then for any e G (0, |), y £ L satisfying \\y — x\\c < 
(1 + e)dc{L,x) can be computed in time 0{-^$) n with probability at least 1 — 2 _n . Furthermore, if 
dc(L,x) < tXi(C,L), t > 2, then a vector y G L satisfying \\y — x\\c = dc(L,x) can be computed in 
time 0(^) n with probability at least 1 — 2~ n . 

Proof. To show the theorem, we use a slightly modified version of Kannan's lifting technique to reduce 
CVP to SAP. Let us define L' C R n+1 as the lattice generated by (L, 0) and (-x, 1). 

In the standard way, we first guess a value (3 > satisfying /3 < dc{L,x) < | /3. Now let C" = 
C x [-4, i]. For (y, z), y G W 1 , z£R, we have that 

||(2/>z)l|c = max { h\\c, /3z,-2/3z} 

Also, note that C'D-C = (Cn-C) x [-^, 4]. Now we see that vol (C n -C") = i vol(CTl-C) and 
vol(C') = vol(C). Therefore, we get that 

vol(C" n -C) = \ vol(C n -C) > )- 7 n vol(C) = | 7™ vol(C') 

Hence C" is 7(1 - l/ra)-symmetric. Let M = {y G ]R™ +1 : y n+i = 0}. Define m : L -> L' \ M by 
= {y — x i 1)> where it is easy to see that m is well-defined and injective. Define 

S = {y £ L : \\y — x\\ c < (1 + e)cfc(L, x)} and 5' = {y G L' \ M : |||/||' c < (1 + e)A(C", L', M)}. 

We claim that m defines a norm preserving bijection between 5 and 5'. Taking y G L, we see that 

\\ m (y)\\c> = \\(y - x,l)\\c =max{\\y-x\\ c , (3,-2(3} = \\y - x\\ c 

since (3 < dc{L,x) < \\y — x\\c by construction. So we have that ||m(y)||c/ = \\y — x\\c, and hence 
X(C',L',M) < mf y£L \\y - x\\ c = d c (L,x). Next take (y,z) G U \M, y G M n , z G R, such that 
II -2) lie*' < (1 + e)A(C", L', M). We claim that 2 = 1. Assume not, then since (y, z) G L' \ M, we must 
have that either z>2orz<— 1. In either case, we have that 

\\{y,z)\\ a =T^{\\y\\ c ,Pz,-2Pz}>max{pz,-2Pz}>2p 
Now since (3 < d c {L, x) < | /3, e G (0, ±), and that A(C", L', M) < d c (L, x), we get that 

|| (y, z)\\c< > 2/3 = (1 + i)(|9) > (1 + i)dc(L, x) > (1 + e)d c (L, x) > (1 + e)A(C", L', M) 

a clear contradiction to our initial assumption. Since z = 1, we may write y = w — x where w G L. 
Therefore, we see that 

= 11(^-^)1)110" = max{||w - x\\ c ,(3, -2(3} = ||w;-a;|| c 

since \\w - x\\ c > d c {L,x) > (3. So we have that (1 + e)X(C',L',M) > \\(y,z)\\ C ' = \\w - x\\ c > 
dc{L, x). Since the previous statement still holds when choosing e = 0, we must have that X(C, L', M) > 
dc(L, x) and hence A(C", L', M) = do(L, x). 

From the above, for y G S, we have that ||m(y)||c" = \\y — x\\c < (1 + e )dc(L,x) = (1 + 
e)A(C", L', M), and hence m(y) G 5 as needed. Next if (y, z) G S', from the above we have that z = 1, 
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and hence (y, z) = (w — x, 1) where w G L. Therefore (y, z) = m(w), where ||to — x\\c = \\(y, z)\\c — 
(1 + e)A(C", L', M) = (1 + e)dc(L, x), and hence w G S. Since the map m is injective, we get that m 
defines a norm preserving bijection between S and S' as claimed. 

Hence solving (1 + e)-CVP with respect to C, L, x is equivalent to solving (1 + e)-SAP with respect 
to C, L', M. We get the desired result by applying 1 + e approximation algorithm for SAP described in 
theorem [331 

For exact CVP, we are given the guarantee that dc(L, x) < t\\(C, L). From analysis above, we see that 
Ai (C, ti) = min{Ai (C, L', M), inf || (y, 0) || c , } = mm{d c (L, x), Ai (C, L)} 

yeL\{0} 

Therefore 

X(C',L',M) = d c {L,x) = mm{d c {L,x),tX 1 (C,L)} < t mm{d C (L, x), Ai (C,L)} = t\i(C',L') 

Hence we may again use the SAP solver in theorem [3761 to solve the exact CVP problem in 0{^) n time 
with probability at least 1 — 2~™ as required. □ 

4 Conclusions and Open Problems 

In this paper, we have shown that an approximate version of Integer Programming (IP) can be solved via 
an extension of the AKS sieving techniques to general semi-norms. Furthermore, we give algorithms to 
solve both the (1 + e) -Approximate Subspace Avoiding Problem (SAP) and Closest Vector Problem (CVP) 
in general semi-norms using these techniques. Due to the reliance on a probabilistic sieve, the algorithms 
presented here only guarantee the correctness of their outputs with high probability. In MDPV11I . it was 
shown that this shortcoming can sometimes be avoided, by giving a Las Vegas algorithm for SVP in general 
norms (which is deterministic for £ p norms) achieving similar asymptotic running times as the AKS sieve 
based methods. The following question is still open: 

Problem: Does there exists a Las Vegas or deterministic algorithm for (1 + e)-SAP or CVP in general 
(semi-)norms achieving the same asymptotic running time as the AKS sieve based methods? 

As for potential improvements in the complexity of (1 + e)-SAP / CVP, the following question is open: 

Problem: Can the complexity of the AKS sieve based methods for (1 + e)-SAP / CVP in general 
(semi-)norms be reduced to 0(^) n ? 
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A Appendix 

Proof of Lemma \Z~3\ (Approx. Barycenter). Let X±, . . . , Xn denote iid uniform samples over K C W a , 
where N = (^) 2 . We will show that for b = Ya=x x i> tnat tne following holds 

Pr [|| ± (b - b{K))\\ K _ b{K) > e] < 4" n (A. 1) 

Since the above statement is invariant under affine transformations, we may assume K is isotropic, i.e. 
b(K) = E[Xi] = 0, the origin, and E[XiX|] = I n , the n x n identity. Since K is isotropic, we have that 
B% C K (see IKLS951 ). Therefore to show dATTb it suffices to prove that Pr[||6|| 2 > e] < 4" n . Since the 
Xi% are iid isotropic random vectors, we see that E[6] = j? = and 

1 1 n 1 

i,je[N] i=l 

Now since the X,s are log-concave, we have that b is also log-concave (since its distribution is a convolution 
of log-concave distributions). Now, given that b has covariance matrix jjl n , by the concentration inequality 
of Paouris MPao06ll . we have that 

Pr[||6 2 || 2 > e] = Pr[||6 2 || 2 > 2c-J=] < e" 2 "' < 4"" 



as claimed. To prove the theorem, we note that when switching the X; 's from truly uniform to 4~ n uniform, 
the above probability changes by at most ^4-™ by Lemma [2~T1 Therefore the total error probability under 
4 - ™-uniform samples is at most 2~ n as needed. □ 

Proof of Lemma \Z4\ ( Estimates for semi-norm recentering). We have z £ M n , x,y G K satisfying 
(f) || ± (x — y)\\K- y < a < 1. We prove the statements as follows: 

1. 

\\z — y\\K-y < t <^ (z — y) G t{K — y) <^ z G tK + (1 — r)y as needed . 

2. Let t = \\z — x\\k-x- Then by (1), we have that z G tK + (1 — t)x. Now note that 

(l-T)(x-y) C \l-r\a{K-y) 

by assumption (f) and (1). Therefore 

z G tK + (1 - t)x = tK + (1 - r)y + (1 - r)(x - y) C tK + (1 - r)y + a|l - r|(if - y) 
= (t + a\l - t\)K + (1 - t - a|l - r\)y 

Hence by (1), we have that 

|| 2 - y\\K- y < t + a\l - t\ = \\z - x\\ K -x + a\l- \\z - x\\k- x \ 

as needed. 
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3. We first show that 

±{y-x) G -2—{K-x) 
1 — a 

By (1) and (f) we have that 

(x-y) e a{K -y)<&(x-y)- a(x - y) G a{K - y) - a(x - y) 

Oi 

& (1 - a)(x - y) G a(K - x) 44> (x - y) G (K - x) 

1 — Q 

as needed. Next since < a < 1, we have that |1 — 2a| < 1. Therefore by (f) we have that 

(1 - 2a)(y — x) G |1 — 2a\a(K - y) C a(if - y) 
since G -fT — y. Now note that 

(1 - 2a)(y - x) G a(iv~ - y) ^ (1 - 2a)(y - x) + a(y — x) G a(iT - y) + a(y - x) 

(X 

(1 - a)(y - an G a(iv~ - x) <=> (y - x) G (K - x) 

1 — a 

as needed. 

Let t = \\z — y\\x- y - Then by (1), we have that z G tK + (1 — r)y. Now note that 

z GrK +(1- r)y = tK + (1 - t)x + (1 - r)(y - x) C tK + (1 - r)x H 11 - r (if - x) 

1 — a 

= + - \1-t\)K + {1-t-- 1-t x 

1 — a 1 — a 

Hence by (1), we have that 

a a 

\\Z - x\\ K -x <T + 1 - T = \\Z - y\\K-y + Z 1 ~ \\Z ~ y\\K-y\ 

1 — a 1 — a 

as needed. 

□ 

Proof of Corollary l26l ( Stability of symmetry). We claim that (1 — ||x||x)(A' fl —K) C — x n x — A. 
Take z e Kf]-K, then note that 

||x + (1 - ||x||x)^|k < \\ x \\k + (1 - ||x||x)||2||^ < \\x\\ K + (1 - ||x||x)||z||xn-^ 

< \\x\\ K + (1 - ||a;||ftr) = 1 

hence x + (1 - ||x||i<:)(A n -K) Cif«(l - ||x||k)(A n -K) (1 K - x. Next note that 

|| - x + (1 - ||x||^)z||_x < || - x\\_ K + (1 - ||x|| A ')|k||-^ < \\x\\ K + (1 - lklk)lklkn-i<: 

< \\x\\k + (1 - \\x\\k) = 1 

hence -x + (1 - ||x||_ft-)(i , C n —K) C — (1 - ||x||,r-)(.K~ n -A) C x - if, as needed. Now we see 
that 

vol{(K - x) n (x - K)) > vol((l - \\x\\ K ){K n -if)) = (1 - ||x||^) n vol(if n —K) 
and so the claim follows from Theorem 12.5 1 □ 
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Proof of Lemma [Ol ( Intersection Lemma). Since \\v\\c < § P, we see that | G 3/4(7. Now we get that 

I + i/4/3(C n -C) C 3/4/3C + 1/4/3C = 
Furthermore since ||| — u||-c = II — lll-C = V 2 IMIc < 3 / 4 A we a l so have that | E u — 3 /4C. Therefore 

I + \p{C n -C) C (t, - ±pc) + ^S(-C) = v - PC 
We therefore conclude that 

voi(/3C n - pC)) voi(| + V4/3(Cn-C)) _ /i\ w vo i(cn -c) ny 

yo\{PC) ~ vol(C) ~W vol (C) ~ ^ 

as needed. 

For the furthermore, we remember that — inf{s > : x G sC} = sup{(x, y) : y G C*} and that 
(— C)* = — C*. Now assume there exists 2 G C+ n C~. Then x = v — pk\ = pk2 — v where k\, k 2 £ C- 
Choose y G C* such that (y, u) = ||f||c- Note that 

{y, v-Ph- (pk 2 -v)) = 2 (y, v) - P((y, h) + (y, k 2 )) = 2\\v\\ c - P((y, h) + {y, k 2 )) 

> 2\\v\\ c - P(\\ki\\c + \\k 2 \\c) > 2/3 - 2/3 = 

Since v — pk\ — (Pk 2 —v) = x — x = 0by construction, all of the above inequalities must hold at equality. 
In particular, we must have that 1 = \\ki\\c = \\k 2 \\c = (y,k\) = (y,k 2 ). Since — y G (— C)*, we know 
that 

v - PC C {x G R n : (-y, x - v) < /3} 

and since (— y, (v — pk\) — v) = /3 (y, k\) = /3, we must have that v — pk\ G dC^ . Via a symmetric 
argument, we get that pk 2 - v G dC~ . Therefore C+ n C~ C dC+ n dC" <=> int(C+) n int(C~) = 0, as 
needed. □ 
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